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Abstract 


In this thesis, we propose the following : 

1. Simpler algorithms for the Symmetric Angle Restricted Nearest Neighbour 
(SARNN) problem in the L\ and L ra metrics. 

2. Simpler algorithms for the Angle Restricted Nearest Neighbour (ARNN) prob- 
lem in the L x and metrics. 

3. An optimal 0(n log n) algorithm for the ARNN problem in any L p metric. 

The algorithms in 1 and 2, above, use the plane sweep technique. The previous 
optimal algorithm for the SARNN problem by Aggarwal and Wee [2] used divide and 
conquer and the concept of finding the row minimas in a totally monotone matrix, 
to get the 0(n log n) complexity. 

In addition, parallelization of the SARNN problem using the totally monotone 
matrix approach is discussed. 
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Preliminaries 


The Angle-Restricted Nearest Neighbour(ARNN) problem and the related problem 
of finding the Symmetric ARNN(SARNN) graph have applications in wire routing 
on integrated circuits [6] and in other proximity problems like the relative neighbour- 
hood graph [11]. In this thesis, we propose some algorithms to solve the ARNN and 
the SARNN problems in various metrics. 

1.1 Definitions 

First, we give some definitions which will be useful in the rest of the report. 

1.1.1 ARNN and SARNN 

Given a set S = {pi,p 2 , ...,p n } of n points in the Euclidean plane, we partition the 
plane around each point into k angular regions where A; is a constant. 

Definition 1.1 For any particular region, we define the angle-restricted nearest 
neighbour (ARNN) of a point Pi to be a point in the given set that lies in this region 
and that is the closest to pi in the L p metric. 

Definition 1.2 Two points, p and q, are symmetric angle-restricted nearest neigh- 
bours(SARNNs) of each other if p is an angle-restricted neighbour of q in one of its 
regions and vice-versa. 
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Definition 1.3 The ARNN graph of S has the points of S as vertices and a directed 
edge from vertex p to vertex q, if and only if q is the ARNN of p in one of its angular 
regions. 

Definition 1.4 The SARNN graph of S has the points of S as vertices and an edge 
between two vertices if and only if the corresponding points are SARNNs of each 
other. 

The ARNN problem consists of finding the ARNN graph of a set of points, S and 
the SARNN problem consists of finding the SARNN graph of a set of points, S. 



1.1.2 L p Metrics 

The definitions in this subsection are standard and have been taken, almost verba- 
tim, from the web-site 1 [8]. 

Definition 1.5 The L p metric over M n for any p > 1 is given by equation 1 

p(x,x') = 

For each value of p, equation 1 is called an L p metric. The three most common 
metrics are Li, L 2 and Loo. 

1 http://msl. cs. uiuc. edu/planning/ nodel 77. html 


£ 

.i=l 




-x'\ p 




( 1 ) 
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Definition 1.6 L\ : The Manhattan metric , which is often nicknamed this way 
because in R 2 it corresponds to the length of a path that is obtained by moving 
along an axis-aligned grid. Taking p = 1 in Equation 1 , for two. points p(x,y ) and 
q(x',y'), we get DIST Ll (p,q ) = \x - x'\ + \y - y'\. 

Example The I^-distance from (0,0) to (2,5) is 7 by traveling “east two blocks” 
and then “north five blocks” I 

Definition 1.7 L 2 : The Euclidean metric , which is the familiar Euclidean distance 
in R n . 

Definition 1.8 L 0 0 : The metric must actually be defined by taking the limit 
of Equation 1 as p tends to infinity. The result is : 


Loo (x, x') = maxi<i< n \%i ~ a/ f | 


( 2 ) 


1.1.3 Monotonicity 

Definition 1.9 An axray is called monotone if the minimum entry in the i th row 
lies below or to the right of the minimum entry in the (i — l) th row for all 2 > i > n. 

Definition 1.10 An axray is said to be totally monotone if every one of its 2 x 2 
sub-arrays is monotone. 

Definition 1.11 A sub-array of an array A is obtained by deleting one or more 
rows and/or columns of A. 

Example If we have a 5 x 5 array and we delete any one row and any two columns 

from it, we are left with a 4 x 3 sub-array of the original array. | 


1.2 Organisation of the Thesis 

In Chapter 2, we first describe the sequential algorithm for the SARNN problem and 
then provide bounds for the parallelization of the same problem using the totally 
monotone matrix approach. In Chapter 3, we describe two simple algorithms for 
the SARNN (Symmetric Angle Restricted Nearest Neighbour) problem in the Li 
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and Lqo metrics. In Chapter 4, we see how minor modifications to the algorithms 
of Chapter 3 can be made to solve the ARNN(Angle Restricted Nearest Neighbour) 
problem in the L\ and metrics. In Chapter 5, we propose some modifications to 
Aggarwal and Wee’s algorithm [2] for the SARNN problem, to optimally solve the 
ARNN problem for any L p metric, 1 < p < oo, in 0(n logn) time. 
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Chapter 2 

The SARNN problem and its 
parallelization 


2.1 Sequential algorithm to find the SARNN graph 

The sequential bound for the SARNN problem is 0(n log n) which was proved by 
Aggarwal and Wee in [2]. Their algorithm makes use of two techniques : 

1. Divide and conquer and 

2. Finding the row minimas in totally monotone matrices. 

Let us assume that k = 8. Hence, the plane around each point is to be divided 
into eight regions (see Figure 2.1). We pair opposite regions and solve the SARNN 
problem for each of these pairs separately. 

Suppose, for example, we consider the pair of regions EN and WS (see Figure 2.1). 
Then, the sequential algorithm [2] for finding the ENWS SARNN graph of a set of 
points, P, is as follows: 

1. Divide the given n points into roughly equal parts, U and V, by a line of 
slope 1, U and V, where U comprises of points below the dividing line and V 
comprises of points above the dividing line. Also, |£/| = r and \V\ = q. This 
is done by first computing c* = y { — Xi for every point (x it y { ), after which the 
median of the c* s is found (say c). Then, the dividing line will be c = y — x. 




Figure 2.1: The regions around a point (for k = 8) 

2. Compute the ENWS(f7) and ENWS(V) recursively using the same procedure. 

3. A pair of points u* € U and Vj € V are said to form a restricted symmetric pair 
for U and V if Ui and Vj are symmetric angle-restricted nearest neighbours in 
P. We compute the restricted symmetric pairs of U and V as follows : 

(a) Convert this problem into that of finding the minimum entry of each row 
of a totally monotone matrix. 

This is done by making the i th row represent the i th point of 
U and the j th column represent the j th point of V. Every en- 
try of the matrix represents the L p distance between the point 
represented by its row and the point represented by its column. 

The matrix formed will be totally monotone. Here, we note that the 
entire matrix does not need to be computed in advance. Every entry of 
the matrix can be computed, on the fly, by a single processor in constant 
time. 

(b) We find the row minimas of this matrix using the algorithm of [1]. Simi- 
larly, we find the column minimas. 

(c) If an entry of the matrix is the minimum in its row and in its column 
then the point representing the row and the point representing the column 
form a restricted symmetric pair. 
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Figure 2.2: Restricted Symmetric pairs (joined by dark lines) 


2.1.1 Analysis 

The recurrence relation is : T(n) = 2T(n/2 ) + D(r,q ) 

where D(r , q ) is the time to compute the restricted symmetric pairs of U and V. 
As the algorithm of [1] takes O(n) time for computing D(r,q), we have T(n ) = 
O(nlogn). Also, space taken is O(n) as the matrix does not need to be computed 
in advance. Instead, the entries of the matrix are calculated as and when needed in 
0(1) time each. 


2.2 Parallel bounds for the SARNN problem 

Here, we parallelize the algorithm that was described in the previous section. 
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2.2.1 Parallel Algorithm 


In step 1 of the algorithm, the points are separated into two halves. This is done by 
finding the median of the (y — x)s of all the points. This step takes 0(log n log log n) 
time using a linear number of operations [7]. Another way would be to sort the c*s 
= ( yi — Xj)s in O(logn) time and O(nlogn) operations [7]. This sorting will also be 
useful in the rest of the stages. Next, we solve the problem for the individual groups 
recursively in parallel. The final step is finding the restricted symmetric pairs. This 
involves finding the row minimas of the totally monotone matrix formed. Let us 
assume that this takes /(n) time and n/g(n) processors. Obviously if f(n) = gin) 
then the algorithm is work optimal and if /(n) = g{n) = log(n) then the algorithm 
is both time and work optimal. 


2.2.2 Analysis 


In stage 1 of the algorithm we have an (n/2) x (n/2) monotone matrix. Finding 
the row minimas of this matrix takes /(n) time and uses n/g(n ) processors. Hence, 
the work done in this step is ( n x f(n))/g(n). Similarly, in stage 2 of the algorithm 
we have 2 in /A) x (n/4) matrices and we have to find their row minimas using 
n/(g(n ) x 2) processors each. Hence, totally (n x f(n/2))/g(n/2) work is done in 
this step. This continues till the matrices have a fixed constant number of elements 
in them. In each stage, both sides of the matrix axe halved. Hence, there are 
O(logn) steps in this algorithm. 


Work = n j + 7^777 + ' ' ’ (lognterms) j 
U(n) ain/ 2) g{n/ A) J 

We also know the following :: /(n) > g(n), f(n) > /(n/2) and g(n) > gin/ 2). 

. ZM > /( n / 2 ) > /( n / 4 ) > ... 

gin) ~ gin/ 2) “ g(n/A) 

Work = O fnl H (log n terms)! 

V Uw s(n) J 

Work = O ( n X M f l0gn 

gin) 
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Similarly, we get, 

Time = 0(f(n) x logn) 

Thus, the time taken to compute the SARNN graph in parallelling the monotone 
matrix paradigm and the CREW model) is at most 0(logn)x the time taken by the 
black-box routine. The work complexity is also increased by a factor of O(logn). 

The best known time-optimal parallel algorithm for the row minima in totally 
monotone matrix is from [3] which does it in O(logn) time. Using this, the par- 
allel algorithm for the SARNN problem will take 0(log 2 n) time and 0(nlog 2 n) 
work. Similarly, the best known work-optimal parallel algorithm for the row min- 
ima in totally monotone matrix is from [4] which does it in 0(log nV log n log log n) 
time. Using this, the parallel algorithm for the SARNN problem will take 
0(log 2 ni/log n log log n) time and O (n log riy'log n log log n) work. A theoretical 
time and work optimal parallel algorithm for the row minima problem would take 
O(logn) time and 0(n ) work. Hence the parallel algorithm for the SARNN problem 
would take 0(log 2 n) time and O(nlogn) work, i.e. would be work-optimal, /fussy 

As yet, there is no NC (poly-logarithmic time), work optimal(0(n)) algorithm 
for finding the row minimas in a totally monotone matrix. 


Type 

Model 

Time 

Work 

Reference 

Sequential 

- 

0(n) 

- 

MfiHH 


CRCW 

0(logn log logn) 

0(n\/logn) 


Parallel 

CREW 

0(logn(loglogn) 2 ) 

0(n\/logn) 



EREW 

O(logn) 

0(n logn) 




0(log n-y/log n log log n) 

0(n\J log n log log n) 


Randomized 

EREW 

_0(log n) 1 



Parallel 

CRCW 

6 (log logn) 




Table 2.1: Complexities of various algorithms for finding the row minimas in a 
totally monotone matrix 


1 0(f(n)) denotes that, with high probability, the complexity is 0(f(n)). 
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Chapter 3 

A simple algorithm for the 
SARNN problem 


In this chapter, we present simpler algorithms for the SARNN problem in the L\ 
and Lqo metrics. We use the plane sweep technique of [5]. In the plane sweep data 
structure, we store the points which could be the SARNN for some future event 
point. Any point, which cannot be a SARNN for a future event point, is deleted 
from the data structure. 

Here too, like in [2], we assume that k = 8. i.e. the plane around each point 
is to be divided into eight regions. We pair opposite regions and solve the SARNN 
problem for each of these pairs separately. Suppose, for example, we consider the 
pair of regions EN and WS. First, we solve the problem for the Li metric. 

3.1 SARNN in the L\ metric 

We first sort the points in the direction of the line with slope 1. i.e sort them by 
the sum (y, + re,). Then, we proceed from the point which has greatest value of 
(Vi + Xi), towards the point which has lowest value of (yi + Xi). Each point will now 
be called an event point. The sweep line has slope —1. We assume that the sweep 
line intersects only 1 point at a time i.e. Vi, (xi + yi) is unique. The sweep line 
stops at each event point, does some processing and then moves on to the next event 
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Figure 3.1: Sweep line and data structure at event point P 



Figure 3.2: Sweep line and data structure after processing point P 
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mt. At each event point, we find the set of points (the shaded points in Fig. 3.1) 
the data structure which could be the SARNN for the event point, i.e. which lie 
the EN region of the event point. Assuming that the event point is (x,, yf) and 
i point to be checked is (xj.yj), this can be checked by the following 2 conditions: 

1 - Vi > Vi 

2 - ( Vi ~ x { ) > (yj - xj) 

Each such point(e.g. the shaded points in Fig. 3.2) is deleted from the structure 
d then the Li distance of this point from the event point is found. If, it is less 
an the current known smallest distance of the points seen so far, then this point 
assumed, for the time-being, to be the SARNN of the event point in the EN 
;ion. After all the points in the EN region of the event point have been looked at 
d deleted, the event point is added to the data structure and then the sweep line 
oceeds to the next event point. 

This is done until the sweep line goes past the last point. At this stage, the 
>ints still on the data structure don’t have a neighbour in the WS direction and 
nee, they can be deleted. The correctness of this method follows from Lemma 3.1. 

amrna 3.1 For any point q, which lies in the data structure and is in the EN- 
gion of the event point p, the ARNN of q, in the WS direction, in the L\ metric, 

P- 

roof: Refer to Fig. 3.3. Draw a horizontal line-segment from q to the sweep 

le, which meets the sweep line at point s. Draw another line-segment from q, 
jrpendicular to the sweep line and which meets the sweep line at r. Any point 
ithin the isosceles right-angled A qrs will have an L\ distance from q which is less 
lan the L\ distance between p and q. Suppose that such a point, r, exists. 

We can see that r lies in the WS region of q. Hence, it will be the ARNN of q 
. the WS region. But, r would have been a previous event point and since q lies in 
s EN region, q would have been deleted from the data structure. We know that q 
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s 


q 



Figure 3.3: An illustration of the proof of Lemma 3.1 

is still in the sweep data structure. Hence, the hypothetical point, r, can not exist. 
Hence, p is the ARNN for any such point q in the Zq metric. | 

Because of the above lemma we know the following: 

(a) Only such points, whose ARNNs in the WS direction in the Li metric have been 
found, are deleted. 

(b) The SARNN of the event point in the EN region is found correctly. 

3.2 SARNN in the metric 

For the L 0 0 metric, we have a lemma which is analogous to Lemma 3.1. Here too, 
we assume that the sweep line intersects only one point at a time. 

Lemma 3.2 For any point q, which lies in the data structure and is in the EN 
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m of the event point, p, the ARNN of q, in the WS direction of q, in the L x 
ric, is p. 

of: Refer to Fig. 3.4. Draw a horizontal line-segment from q to the sweep 

meeting it at s. Draw another line-segment from q to the sweep line making 
ngle of 45° at q and which meets the sweep line at r, a point lower than s. Any 
it within the isosceles right-angle A qrs will be in the WS region of q and will 
5 an Lqo distance from q which is less than the Loo between p and q. Suppose 
; such a point, r, exists. 

Hence, r will be the ARNN of q in the WS region. But, r would have been a 
nous event point as the sweep line proceeds from right to left, and since q lies in 
3N region, q would have been deleted from the data structure. We know that q 
dll in the sweep data structure. Hence, the hypothetical point, r, can not exist, 
ice, p is the ARNN of any such point q in the L^ metric. | 



Figure 3.4: An illustration of the proof of Lemma 3.2 

We use the same technique as for the L\ metric except for a few minor changes. 
;re, the points axe sorted according to their X-coordinates. Also, the sweep line 
vertical and it moves horizontally from right to left. For every event point, the 
lints in the sweep data structure, which are also in its EN region are looked at 


14 



and evaluated according to their distance from the event point. We see that all 
the points in the data structure in the EN region of the event point can be deleted 
from it (after checking the L 0 0 distance from the event point) because of Lemma 3.2 
above. The full algorithm, for the sake of completeness, is given in Appendix A. 


3.3 Complexity Analysis 

The points can be sorted in O(ralogn) time in the required direction. At each 
event point we find the lowest point on the data structure which could be the event 
point’s SARNN. This can be found in O(logn) time as we can use a red-black tree 
(see Cormen et al. [5]) to store the sweep data structure. This is done for each of 
the points in the set. Hence this part of the algorithm takes 0(n log n) time. 

Next, we notice that points are added only once to the sweep data structure and 
deleted when their distances to an event point are found. Hence there are 0(n) 
updates to the data structure in total. As each insertion or deletion in a red-black 
tree takes O(logn) time [5], it follows that the overall algorithm takes 0(n log n) 
time. 
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Chapter 4 

A simple algorithm for the ARNN 
problem 


In this chapter, we present simpler algorithms for the ARNN problem in the L\ and 
L 0 o metrics. The techniques and data structures used axe the same as that for the 
SARNN problem. 

Here too, like in [2] , we assume that k = 8. Hence, the plane around each point is 
to be divided into eight regions. However, we don’t pair regions like in the SARNN 
problem. Instead, the graph for each region is found separately and then merged 
later on to get the complete ARNN graph. 


4.1 ARNN in the L\ metric 

The algorithm for the ARNN problem in the L\ metric is not very different from 
the algorithm for the SARNN problem in the L\ metric. The only change in the 
SARNN algorithm is that at each event point, we set the point in the sweep data 
structure which was the closest to the event point in its EN region to be the SARNN 
of the event point. Here, we set the ARNN in the WS direction of all points in the 
EN region of the event point, and in the data structure, to be the event point. All 
such points are deleted from the data structure after this, the event point is then 
added to the data structure and the sweep line proceeds to the next event point. 
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the event point is then added to the data structure and the sweep line proceeds to 
the next event point. The full algorithm for the sake of completeness is given in 
Appendix B. 

4.2 ARNN in the metric 

The algorithm to find the ARNN graph in the metric is the same that for the 
SARNN graph in the metric except for the changes mentioned above. The full 
algorithm for the sake of completeness is given in Appendix C 

4.3 Analysis 

There is not much change in the algorithms from that of the SARNN graph. We 
look at each of the points in the data structure once, immediately set its ARNN 
in the WS region and then delete it. Hence, the data structure is updated 0(n ) 
times. As each update to the data structure takes O(logn) time [5], the complexity 
remains O(nlogn). 
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Chapter 5 

An optimal algorithm for the 
ARNN problem 


Guibas and Stolfi [6] describe a divide and conquer algorithm to solve the Angle- 
Restricted Nearest Neighbour (ARNN) problem that takes O(nlogn) time for the L\ 
and Lqo metrics. For the general case of the L p metric where p > 1, Wee et al. 
[11] describe a divide and conquer algorithm that takes 0(nlog 2 n) time. Here, we 
present an algorithm to solve the ARNN problem in the L p metric for p > 1 in 
O(nlogn) time. 

5.1 The Algorithm 

The algorithm that we use for the ARNN problem is almost the same as that used 
by Aggarwal and Wee [2] for the SARNN problem except for some changes which 
are described below. First, we compute the L\ nearest neighbour for each point in 
all the regions. This we can do by using the method in [6] or by using the algorithm 
proposed in section 4.1 earlier. 

We compute the ARNN graph for each region separately. Here, we compute the 
ARNN graph of the set of points, S for only the West-South (WS) region, which we 
shall denote by WS(S). It is easy to see that if an algorithm finds the ARNNs of 
all points in the WS region then by suitable rotation of the axes, we can find the 
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ARNNs of all the points in all of the regions and hence obtain the complete ARNN 
graph. 

We have a set S of points for which we need to find the ARNN graph. Let l 
denote a line with slope 1, that splits the points in S in two roughly equal parts, 
U = {ui, ••• ,u r } and V = {iq, • • • ,u ? }, the points of which occur in increasing 
y-coordinates, the points of U lie to the right of l and the points of V lie to its 
left. Now to compute WS(S). We individually compute WS(U), WS(V) and the 
restricted pairs for U and V. Hence, we need to compute all the restricted pairs of 
U and V. 

Definition 5.1 A pair of points, Ui € U and Vj G V form a restricted pair if Vj is 
the angle-restricted nearest neighbour of U{ in the WS region. 

To compute the restricted pairs, we transform this problem into that of finding 
the row minimas in a totally monotone matrix, in linear time. Now, since finding the 
row minimas in a totally monotone matrix also takes linear time using the algorithm 
in [1], we can solve the overall problem in 0(n log n) time which is optimal. 

Definition 5.2 A point v is dominated by a point u if v lies in the West-South 
octant of u. In other words, if the coordinates of u and v are (x u , y u ) and (x v , y v ) 
respectively, then y u > y v and (y u — x u ) > (y v — x v ). 

Lemma 5.1 Let Ui , Uj G U be any two points such that Uj dominates Ui, and let Vk 
he any point in V whose y- coordinate is no higher that that of Ui. Then, Vk cannot 
be the restricted nearest neighbour of Uj among all points in S. 

Proof: See proof of Lemma 2.1 in [2]. I 

Let the L p nearest neighbour of a point r be the point closest to r if the L p metric 
is used to compute the distances. 

Lemma 5.2 If the L\ nearest West-South neighbour of a point u € U belongs to U , 
then the L p nearest West-South neighbour of u also belongs to U, for any p > 1. 
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Proof: See proof of Lemma 2.2 in [2]. | 

Let U' = {ut E U\ the L x nearest West-South neighbour of u { belongs to V} 
and let RP(U,V) denote the restricted pairs for U and V. Then from Lemma 5.2, it 
follows that RP(U, V) = RP(U’, V). Hence, from now on, we compute RP(U’, V) 
instead of RP(U, V). 

Lemma 5.3 Let u {l Uj , u t e U' such that Uj dominates u, and the y-coordinate ofui 
is no less than that of Uj . And, let v k be any point in V whose y-coordinate is less 
than that of u x . Then, ui cannot be the L p nearest neighbour of v k in the EN region. 

Proof: See proof of Lemma 2.3 in [2]. | 

5.1.1 Transformation 

Next, we transform the problem of finding the restricted pairs into that of finding 
the row minimas in a totally monotone matrix. We first note that the points of 
U' = {ui, • • • , ti r ) and of V = {v x , • • • ,v q } occur in the increasing order of their 
y-coordinates. The next two definitions are taken from [2]. 

Definition 5.3 For 1 < i < q, the top anchor of u,, denoted by top(ui), is defined 
to be the index of the highest point of V that belongs to the West-South octant of 
Uj. Formally, top(o) = b if r/b = max { yj \ yj < y a and j € V}. 

Definition 5.4 For 1 < i < q the bottom anchor of u x , denoted by bot(ui), is defined 
as follows : 

{ bot(ui-i ) if Ui does not dominate tij_i 
top(ui-i) otherwise 

We can find the top and bottom anchors for all points in U' in 0(r + q ) time [2]. 

Lemma 5.4 For 1 < i < r andl < j < q , can be the restricted nearest neighbour 
of Vj only if 

(a) j < top(ui) and 


20 



(b) j > bot(ui). 

Proof: If j > top(ui ) then v k does not lie in the West-South octant of u t . 

Inequality (b) follows from Lemmas 5.1 and 5.3. | 

Now, we use the above lemma in constructing the totally monotone matrix. We 
consider a rxq matrix, M = such that for 1 < i < r and 1 < j < 9j m(ij) 

equals the L p distance, p > 1, between u* and Vj if both inequalities (a) and (b) of 
Lemma 5.4 hold, and m(i,j ) = oo otherwise. Then, for 1 < i < r and 1 < j < q, 
(' u i,Vj ) can form a restricted pair for U' and V only if m(i,j ) is the smallest entry 
in the i th row. 

Lemma 5.5 For 1 < i < r, let s(i ) be defined such that M(fs(i )) contains the 
minimum entry in the i th row of M. Then, VI < i < r, s(i) can be computed in 
0(r + q) time. 

Proof: See proof of Lemma 3.2 of [2] . In the proof of Lemma 3.2 of [2], Aggarwal 

and Wee show that the matrix M discussed above is totally monotone and that by 
using the algorithm in [1] we can find the row minimas in 0(r + q ) time. | 

After finding the minimum entry in each row of M, say m(i,j) is such an entry, 
we check whether m(i,j) is less than the distance between iq and its West-South 
nearest neighbour in JJ. If so, then (ui,vf) is a restricted pair for U and V. 


5.2 Analysis 

Let T(n) denote the time to compute WS(S) for the n given points. Also, let D(r, q ) 
denote the time to compute all restricted pairs for U and V . It can be verified that 
T(n) < 2T(n/2) + D(r, q), and T(2) = c. Now in computing the restricted pairs, 
we convert it to a totally monotone matrix. Here we note that the entire matrix 
need not be computed and stored in advance. The matrix elements are actually 
distances between two points and can be computed on the fly in 0(1) time each. 
Also computing the row minimas takes 0(r + q ) time [1]. The operations in the end, 
of checking the WS nearest neighbour in U, do not take more than 0(r + q) time. 
Hence D(r, q) = 0(r + q) = 0(n). Therefore, we have the following : 
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Theorem 5.6 Given a set S of planar points, in the L v metric, 1 < p < oo, 
ARNN(S) can be computed in 0(n log n) time. 



Chapter 6 
Conclusions 


We propose algorithms to solve the SARNN problem optimally in 0(n log n) time 
in the L x and the metrics using the plane sweep technique. We show that with 
slight modifications, the above algorithms can be used to solve the ARNN problem 
in the L x and the L Q 0 metrics also in 0(n log n) time. 

We also propose some modifications to Aggarwal and Wee’s algorithm for the 
SARNN problem to optimally solve the ARNN problem for any L p metric, 1 < p < 
oo, in 0(n log n) time. The earlier algorithm of [6] takes 0(nlog 2 n) time. 

All these algorithms are cost optimal, as it can be easily shown that 12 (n log n) is 
the lower bound for these problems (which follows from the lower bound for closest 
pair/element distinctness [9]). However, it may be possible to further simplify them. 

In addition, we discuss parallelization of the SARNN problem using the mono- 
tone matrix method. If an O(logn) time, optimal parallel algorithm is discovered 
for the monotone matrix problem, we show that this will also result in an optimal 
parallel algorithm for the SARNN problem which will take 0(log 2 n) time and does 
0(n logn) work. 

Discovering an optimal parallel algorithm directly for SARNN problem is open. 
Discovering an optimal parallel algorithm for monotone matrix problem is, in itself, 
an interesting problem. 
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Appendix A 

Complete Algorithm for the 
SARNN problem in the L oo metric 

Here, too we make the same assumptions as in Chapter 3. To find the SARNN 
graph of the ENWS region pair, in the Loo metric, we do the following : 

First, we sort the points by their X-coordinates. The sweep line is vertical and 
proceeds from right to left. We assume that the sweep line intersects only 1 point 
at a time i.e. no two points have the same X-coordinate. At each event point, we 
find the set of points on the data structure which could be the SARNN for the event 
point, i.e. which lie in the EN region of the event point. This can be checked by 
the following 2 conditions : 

1) Vj > Hi and 2) (yi — Xi) > (yj — Xj) where the event point is (x i: iji) and the point 
to be checked is ( Xj,yj ). 

Each such point is deleted from the structure after checking its L ^ distance to 
the current event point. After all the points in the EN region of the event point 
have been processed, the point with the least L x distance from the event point is 
set as its SARNN. Then, the event point is added to the data structure and the 
sweep line proceeds to the next event point. This is done until the sweep line goes 
past the last point. At this stage, the points still on the data structure are deleted 
as they don’t have a neighbour in the WS direction. The correctness of this method 
follows from Lemma 3.2. 
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Appendix B 

Complete Algorithm for the 
ARNN problem in the L\ metric 


Here, we assume that k — 8. Hence, the plane around each point is to be divided 
into eight regions. Now here, we don’t pair regions like in the SARNN problem. 
Instead, the graph for each region is found separately and then merged later on to 
get the complete ARNN graph. To find the ARNN graph of the WS region, we do 
the following : 

We first sort the points in the direction of the line with slope 1. i.e sort them 
by the sum (y* + Xi). Then we proceed from the point which has greatest value 
of (yj + towards the point which has lowest value of (y* + Xi). The sweep line 
has slope -1. We assume that the sweep line intersects only 1 point at a time. At 
each event point, we find the set of points on the data structure which could be the 
ARNN for the event point, i.e. which lie in the EN region of the event point. This 
can be checked by the following 2 conditions : 

1) Uj > Vi and 2) (yj — Xj) > {yj — Xj) where the event point is ( x^yi ) and the point 
to be checked is ( Xj,yj ). 

Each such point is deleted from the structure after setting its ARNN in the WS 
direction to be the current event point. After all the points in the EN region of the 
event point have been looked at and deleted, the event point is added to the data 
structure and the sweep line proceeds to the next event point. This is done until 
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the sweep line goes past the last point. At this stage, the points still on the data 
structure don’t have a neighbour in the WS direction and hence they can be deleted. 
The correctness of this method follows from Lemma 3.1. 

We can find the ARNN graphs of the rest of the region by suitable rotation of 
the axes. These graphs are then merged to get the complete ARNN graph. 
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Appendix C 

Complete Algorithm for the 
ARNN problem in the Lqq metric 


Here, too we make the same assumptions as in Appendix B. To find the ARNN 
graph of the WS region in the L 0 0 metric, we do the following : 

First, we sort the points by their X-coordinates. The sweep line is vertical and 
proceeds from right to left. We assume that the sweep line intersects only 1 point 
at a time i.e. no two points have the same X-coordinate. At each event point, we 
find the set of points on the data structure which could be the ARNN for the event 
point, i.e. which lie in the EN region of the event point. This can be checked by 
the following 2 conditions : 

1) Uj > Hi and 2) (yi — Xi) > (yj — Xj) where the event point is (x,, yi) and the point 
to be checked is (. Xj,yj ). 

Each such point is deleted from the structure after setting its ARNN in the WS 
direction to be the current event point. After all the points in the EN region of the 
event point have been processed, the event point is added to the data structure and 
the sweep line proceeds to the next event point. This is done until the sweep line 
goes past the last point. At this stage, the points still on the data structure are 
deleted as they don’t have a neighbour in the WS direction. The correctness of this 
method follows from Lemma 3.2. 
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